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(57) Abstract: The system and method for business systems transactions and infrastructure management includes a client side 
collection system and a server side management system having the c^ability to communicate with the other over a secure commu- 
nication path. The collection system extracts transaction and infrastructure data from a business system and transmits the extracted 
data to the management server over a secure corrununication path. The management server receives the extracted data over the se- 
cure communication path, compares it to threshold values, analyzes the data that exceed threshold values, and notifies a user of the 
results of the analysis. The management server also has the capability to activate corrective scripts in the client business system and 
to reconfigure the client business system. 
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SYSTEM AND METHOD FOR BUSINESS SYSTEMS TRANSACTIONS AND 
INFRASTRUCTURE MANAGEMENT 



5 

BACKGROUND 

This invention relates generally to ensuring business system performance, and 
more particularly, to monitoring, analyzing, and optimizing systems performance to 
achieve information technology goals and business objectives, 

10 In the developing years of business use of electronic systems, business systems 

were used primarily for accounting and records keeping functions. As these systems grew 
in capability and features, business managers began to make use of these capabilities and 
features in new ways to improve business performance. With the widespread acceptance 
and use of the Internet for conducting on-line commercial and consumer transactions, 

15 these business systems have become increasingly complex and geographically distributed. 
At the same time, there developed increasing demand for higher performance and 
increased reliability or "uptime" to satisfy these business needs. This has put greater 
emphasis and visibility on the role of the information technology (IT) infrastructure of e- 
conmierce businesses, and the people that support these systems. Concurrently, there has 

20 developed a trend whereby business managers have a need to play a more active role in 
IT infrastructure decision-making. 

Conducting business over the Internet has created many new challenges that are 
difficult to manage using conventional approaches. Companies with activities that rely on 
e-commerce stmggle to fmd solutions that will assist with managing increasingly 

25 complex infrastructure while satisfying a more demanding customer base. In particular, 

1 
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downtime costs can have a substantial impact on the gross revenues of an e-commerce 
organization, as well as losses due to brand erosion and customer dissatisfaction. As these 
companies become increasingly dependent upon e-Business as a significant source of 
revenue, the success of the overall business is inextricably linked to the health of the IT 
5 infrastructure. The lack of tools to communicate critical information concerning the 

condition of the IT infrastructure to business managers fiirther complicates this picture. A 
further complexity is that many e-commerce systems are widely distributed over a large 
geographic area, where a principle means of communications between parts of the system 
is via the Internet. 

10 Businesses are fiirther faced with the problem of translating IT organization 

performance goals into objectives that satisfy the needs of the business. While there is a 
need for a reliable, high performance infrastructure for executing business processes, 
there is often a lack of understanding of the impact that systems, applications, and process 
execution breakdowns have on business objectives, because of the inability to measure IT 

15 performance against business objectives. Regarding reliability and performance, existing 
management tools for heterogeneous and complex business processes offer incomplete 
and inadequate coverage of individual infrastructure elements. There is a lack of a 
systems management solution that encompasses the entire execution infrastructure as a 
single entity, capturing the interrelations between systems, applications, and business 

20 processes. A solution must be capable of automatically adapting to constant changes in 
this execution environment. 

It is often difficuh for IT organizations to relate the impact of process execution 
breakdowns to business objectives. Since infrastructure failures are viewed as isolated IT 
events, the impact on the business is noticed only when a product line manager or analyst 

25 sees a change in business performance. A solution is needed that will capture the business 
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process workflows and relate potential and actual infrastructure breakdowns to business 
process execution, enabling IT and business managers to find a collaborative solution. It 
is desirable to have a solution that is capable of expediting and automating the 
remediation process. 

5 IT contributions to the core business are often measured using IT-oriented metrics, 

rather than metrics that support IT contributions to business goals or corporate strategy. A 
solution is needed that reports IT performance in terms of business metrics rather than 
infrastmcture performance. By combining business and IT metrics, information may be 
generated that will enable business decisions to be made based on a correlation of IT 

10 performance and business goals. 

This environment has created a need for a unified solution for monitoring the 
health of real-time e-business infrastructures to improve the quality, reliability and total 
cost of ownership of e-business sites. This translates to the bottom line as greater 
customer satisfaction, a loyal customer base, and increased revenues. It is therefore an 

15 object of the present invention to provide a comprehensive solution for managing 
complex e-business infrastructures. 

Another object of the present invention is to enable improved performance of e- 
commerce systems by monitoring e-business transaction integrity, transaction 
perfomiance, and infrastructure reliability, with a focus on problem identification, 

20 diagnosis and analysis. 

Yet another object of the present invention is to collect data from e-commerce 
systems and transmit this collected information to a business systems transaction and 
infrastructure management system for evaluating, reporting, and managing e-commerce 
system configuration, providing real-time diagnostics, predictive analysis and impact 
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analysis, and by reducing downtime and detecting incipient failures before there is a 
significant impact of business performance. 



SUMMARY OF THE INVENTION 
5 The present invention satisfies the needs for transaction monitoring and 

infrastructure management of modem business systems by providing core technology for 
management of applications, databases, web servers, and operating systems. It also has 
the capability to manage the dynamic relationships between each of the major 
components of business systems by making use of a knowledge base and collected data to 

10 diagnose, analyze, and resolve problems in real time. The present invention provides 

predictability by ensuring reliability, repeatability, and optimal performance. It improves 
the total cost of ownership by reducing support costs, reducing time for problem 
resolution, and predicting and preventing potential problems. The business systems 
transaction and infrastructure management system makes use of light-weight data 

15 collectors embedded in a client system to extract in-depth information about e-business 
applications, web servers, databases, operating systems and hardware platforms in a 
transaction execution environment. This collected data is delivered to a management 
system server where it is recorded and analyzed. If a problem is identified, the relevant 
condition is diagnosed and corrective action is either recommended or automatically 

20 implemented. 

The management server contains powerful analysis engines that allow the 
software to quickly diagnose complex, cross-domain problems, analyze system 
performance, forecast potential failures, and recommend or automate corrective actions. 
The management system can also automatically reconfigure the client system to adapt to 

25 environment changes and provide remote notification for problems requiring immediate 
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attention. Key c^abilities include real-time diagnosis of cross-domain problems, 
performance analysis, predictive analysis, automated problem solving, remote 
notification, and automatic configuration capability. 

The business systems transaction and infrastructure management system improves 

5 the reliability, repeatability, and total cost of ownership of complex business 

environments. Use of this system results in optimized performance, increased reliability, 
and enhanced availability to increase customer satisfaction and retention. The 
management system minimizes or eliminates transaction failures, thereby reducing lost 
revenue, protecting brand image, and safeguarding credibility. Visibility is provided into 

10 all aspects of the business system environment, enabling optimal configuration control 
and facilitating troubleshooting. Root-cause analysis of cross-domain problems and 
automated/recommended corrective actions reduce support costs and enable proactive 
improvements to the transaction execution environment. Impact analysis improves 
resource prioritization and facilitates proactive response to problems affecting overall 

15 business. 

One embodiment of the present invention is a system for managing business 
systems transactions and infrastructure that comprises a collection system embedded in a 
client system, which includes one or more data collectors having at least one plug-in for 
extracting data from client system components, and a secure client connection to a 

20 commxmication network for transmitting the extracted data from the client aggregator and 
communicating over the network, and a management server, which includes a system 
manager for controlling the management server, a secure server connection to the 
communication network for communicating over the network and receiving the extracted 
data from the client collection system, the system manager for comparing the received 

25 extracted data with threshold values, a repository for storing a knowledge base and the 
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extracted data, a reasoning system for performing data analysis on the extracted data that 
exceed threshold values, and a notifications manager for notifying users of abnormal 
conditions based on the data analysis. The system may further comprise a client 
aggregator for receiving the extracted data fi-om the data collectors and a server 

5 aggregator for receiving the extracted data fi^om the secure connection. The system may 
further comprise a corrective actions manager for activating corrective action scripts in 
client systems over the communication network. The system may further comprise a 
graphical user interface server connected to the secure server connection to the 
communication network and a web browser graphical user interface connected to the 

10 secure client connection to the communication network for communication with the 

graphical user interface server. The collection system may further comprise one or more 
configuration clients for receiving configuration commands for configuring the client 
system, and the management server may further comprise a configuration manager for 
sending configuration commands to the collection system. The data collector plug-in may 

15 be selected fi-om the group consisting of operating system plug-in, database plug-in, 

business process plug-in, web server plug-in, and application plug-in. The secure server 
connection and the secure client connection to the communication network may be 
firewalls. The management server may further comprise a business process manager for 
analyzing and tracking client business system processes based on the extracted data. The 

20 management server may fiirther comprise a data processor for performing complex 

calculations. The management server reasoning system may comprise a diagnostic engine 
and a predictive analysis engine for analyzing the extracted data datasets and activating 
the notifications manager. The management server reasoning system may comprise a 
diagnostic engine and a predictive analysis engine for analyzing the extracted data 

25 datasets and activating the corrective actions manager. The repository may be an object 



6 



wo 02/079928 PCT/US02/09103 
oriented database management system. The repository may be a relational database 
management system. The system components from which data are extracted may be 
selected from the group consisting of a database host operating system, a database host 
database management system, a database host business process, a database host hardware 
5 components, a web host operating system, a web host web server, a web host business 
process, a web host hardware components, an application host operating system, an 
application host application programs, an application host business process, and an 
application host hardware components. The embedded collection system may fiirther 
include corrective scripts associated with the data collectors that are activated by a server 
10 command from the corrective actions manager. The communication network may be the 
Internet, 

Another embodiment of the present invention is a method for managing business 
systems transactions and infrastructure, which comprises the steps of extracting data from 
client system components by a data collector and data collector plug-ins, transmitting the 

15 extracted data through a secure client connection to a communication network, receiving 
the extracted data through a secure server connection to the communication network, 
comparing the extracted data by a system manager with threshold values stored in a 
repository, submitting the extracted data to a reasoning system if the extracted data 
exceeds a threshold value, analyzing the extracted data submitted to the reasoning system, 

20 and notifying a user affected by the results of the reasoning system analysis by a 

notification manager. The method may further comprise aggregating the extracted data by 
a client aggregator, transmitting the extracted data by the client aggregator, and receiving 
the extracted data by a server aggregator. The method may further comprise storing the 
received extracted data in the repository. The method may further comprise automatically 

25 activating a corrective script in the client system by a corrective actions manager to 
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correct a problem found by analyzing the extracted data The method may fiirther 
comprise manually activating a corrective script in the client system by a corrective 
actions manager to correct a problem found by analyzing the extracted data. The 
analyzing step may comprise detecting a problem from the extracted data by a diagnostic 
5 analysis engine and associating the detected problem with a recommended solution found 
in a knowledge base stored in the repository. The analyzing step may comprise collecting 
the extracted data over time, detecting a trend in the collected extracted data by a 
predictive analysis engine, and estimating a time duration for a failure condition to occur. 
The notifying step may comprise accessing a knowledge base in the repository to 

10 determine a user affected by the results of the reasoning system analysis, sending the 
results of the reasoning system analysis to a graphical user interface server, and 
transmitting the analysis results over the communication network to a client web browser 
graphical user interface for presentation to the affected user. The notifying step may be 
selected from the group consisting of transmitting an email message, sending a numeric 

15 page, and transmitting a text page. The method may fiirther comprising transmitting a 
client system configuration change request from a client web browser graphical user 
interface through the secure client connection to the communication network, receiving 
the configuration change request by a configuration server through the secure server 
connection to the communication network, communicating the configuration change 

20 request to a configuration manager from the graphical user mterface server, validating the 
configuration change request and conmiunicating a configuration change order to the 
gr^hical user interface server by the configuration manager, transmitting the 
configuration change order from the graphical user interface server through the secure 
server connection to the communicafion network, and receiving the configuration change 

25 order by a configuration client through the secure client connection to the communication 
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network. A computer-readable medium may contain instructions for controlling a 
computer system to carry out the steps of this embodiment of the invention. 

Yet another embodiment of the present invention is a method for managing 
business systems transactions and infrastructure, which comprises the steps of receiving 
5 extracted transactions and infrastructure data from a business system, comparing the 
extracted data with threshold values stored in memory, analyzing the extracted data that 
exceeds threshold values, and notifying a user of a result of the analyzing step. The 
method may further comprise activating a corrective script based on a result of the 
analyzing step. The method of may further comprise sending configuration commands to 

10 the business system. The method may further comprise sending notifying information to a 
graphical user interface. A computer-readable medium may contain instructions for 
controlling a computer system to carry out the steps of this embodiment. 

Yet another embodiment of the present invention is a method for managing 
business systems transactions and infrastructure, which comprises the steps of extracting 

15 transactions and infrastructure data from a business system and sending the extracted data 
to a system for comparing with thresholds, analyzing extracted data that exceeds 
thresholds, and notifying a user of a result of the analyzing step. The method may further 
comprise activating corrective scripts in the business system. The method may further 
comprise reconfiguring the business system upon receipt of a reconfiguration command. 

20 The method may further comprise receiving notification information and presenting the 
information on a graphical user interface. A computer-readable medium may contain 
instructions for controlling a computer system to carry out the steps of this method. 
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BRIEF DESCRIPTION OF THE DRAAVINGS 
These and other features, aspects, and advantages of the present invention will 
become understood with regard to the following description, appended clainas, and 
accompanying drawings where: 
5 FIG. 1 shows a representation of a business process execution environment; 

FIG. 2 shows the relationship between a client subsystem and a management 
system server; 

FIG. 3 shows an architecture of a collecting system that resides in one or more 
client host business systems; 
10 FIG. 4 shows an architecture of a management system server; 

FIG. 5 shows a flow diagram for a data collection process; 
FIG, 6 shows a flow diagram for a data analysis process; and 
FIG. 7 shows a flow diagram for a corrective action process. 

15 DETAILED DESCRIPTION 

Turning now to FIG. 1, Fig. 1 shows a representation 10 of a business process 
execution environment of the present invention. This is the environment in which a 
business management system must function. An IT strategy 102 and a business strategy 
106 are derived from an enterprise strategy 104. From the IT strategy 102, tools, 

20 applications, and resources 108 are determined that are necessary to implement the IT 
strategy 102. Similarly, business process and business rules 1 10 are formulated to meet 
the business strategy 106. A management system is then defined, modeled, and executed 
to meet the IT strategy 102 and the business strategy 106. The IT organization must then 
monitor, analyze and optimize the results 1 1 8 by determining IT metrics 114 and business 

25 metrics 116. These metrics 114, 1 18 are then fed back to the definition and execution 1 12 
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of the management system to further improve performance. To determine whether 
business unit objectives are being met, IT organizations must understand the impact of 
systems, applications, and process failures on the business objectives. The present 
management system models and monitors an organization's business processes and 
5 infrastructure components to capture business process execution 112. Monitoring, 
analyzing and optimizing the execution of the business process ensure that IT 
achievements meet business unit objectives. Knowledge of the relationship between 
business processes and infrastructure components helps to identify compromised business 
objectives. By determining the dynamic path of execution for business processes, 

10 organizations can ensure an optimized workflow and healthy execution environment. The 
system monitors the business process transactions and the behavior of the infrastructure 
components for detecting execution failures and diagnosing the cause of such failures. 

To monitor the execution of a business process, the system must provide insight 
into the health of the system from the perspective of the business process. The system 

15 must analyze factors such as the number of executions, the length of executions, the 

presence of failed executions and their causes, and the consequences of the system failure 
on the business process. Process monitoring involves tracking the execution of business 
processes across applications and FT components, and collecting status information at 
each execution step, such as current order state, database data volume, total number of 

20 orders processed, and number of orders fulfilled and orders not fulfilled. Monitoring a 
business process involves tracking various business processes such as web page 
generation, online purchasing, order notification, and order management; using 
predefined workflow models that encapsulate the business process hierarchy and its 
relationship with the application and infirastmcture components that comprise the process 

25 execution environment; tracking the execution of business processes using metrics from 
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collectors; gathering, correlating, calculating, and aggregating execution data into 
workload data that is analyzed to determine the state of the execution process; detecting 
process execution failures and process execution bottlenecks; and aggregating various 
business-oriented metrics such as the number of business transactions executed over time, 

5 data volume by transaction type, number of process requests, etc. In addition, to monitor 
the execution of a business process, a system first models the target business process and 
breaks it into a sequence of tasks. It then identifies the set of application or infrastructure 
components on which the task relies in order to properly execute. These components 
identify the key performance indicators that are used to track and analyze the execution 

10 environment. Monitoring and diagnosis is facilitated in this low level data, resulting in 
three levels of information: a system-wide view of the busmess workload and 
infrastructure resource consumption; detailed view of failed executions of business 
processes; and historic view of the past business workload and system behavior. 

Turning now to FIG. 2, FIG. 2 shows the relationship 20 between a client 

15 subsystem 202 and a management system server 206 connected via the Internet 204. 
The client subsystem 202 and the management system server 206 comprise elements of 
the present invention, a system for business systems transactions and infrastructure 
management. The client subsystem 202 includes data collectors for collecting data from 
a client's business system components, a configuration client for initiating and altering 

20 a configuration of the client's business system components, a web browser Graphical 
User Interface (GUI) for controlling the configuration client, an client side aggregator 
for collecting and routing data, enabling the data to pass through firewalls, and a client 
side firewall that connects to ttie Internet 204 for communicating with the management 
system server 206. The data collectors extract data from every application, server, and 

25 database at every step of the execution process, which is analyzed and correlated by the 
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■ management system server 206 to detect execution failures. The management system 

i 

! 

! server 206 also connects to the Internet 204 by way of a server side firewall and a 

i 

server side aggregator, for communicating with the client subsystem 202. The 

I 

I management system server 206 includes a reasoning system for diagnosing and 

! 5 predicting failures, means for notifying of a fault and for taking corrective action, a 

I 

configuration manager for maintaining the configuration of the client's business system, 
a business process manager and a system manager, a data repository, and a GUI server 
to handle the client subsystem configuration client. The management server 206 
captures and represents expert knowledge about the system processes and infrastructure 

i 

10 components. Through monitoring and analysis, the management system server 206 is 
' able to detect and predict problems with performance and resource availability. Using 
an expert knowledge base, the system may recommend recovery actions for both 
inunediate resolution and long-term problem prevention. It provides dynamic system 
analysis and diagnosis for identifying complex system problems involving multiple 
1 15 faults and symptoms across multiple enterprise components. 

■ Taken together, the client subsystem 202 and the management system server 206 
combine infrastructure management with business process visibility to manage the 
business transaction environment as a single entity. The management system assesses 

^ the state of the business and determines what conditions are affecting it based on 
20 collected data such as transaction execution information, including but not limited to 
user navigation on the site, transactions being executed, transactions not resulting in 
sales, etc. The system allows IT organizations to evaluate information against the 
business process to fully understand the business impact of IT events. 

Turning now to FIG. 3, FIG. 3 shows an architecture of a system client 30 that 
25 resides in one or more client host business systems. The system client comprises a 
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collection system, a web browser based graphical user interface, and configuration 
clients. It works as an integral part of a business systems transactions and infrastructure 
management system that includes a management server. An embodiment of a 
management server for use with a collection system is described subsequently and 

5 shovm in FIG. 4. The collection system uses lightweight data collectors 318 to retrieve 
information from business application hosts 306, server hosts 304, database hosts 302 
and other components of a business system. All collected data is sent to a transactions 
and infrastructure management server, such as the embodiment shown in FIG. 4, where 
it is stored in a repository. The stored data is used to track client business systems 

10 processes, monitor performance, and predict and diagnose problems. The data 

extractors on the client hosts 302, 304, 306 are called data collectors 318. There is at 
least one collector for each host in the client system 302, 304, 306. Each collector 318 
contains specialized plug-ins that are individually enabled by a client for each host 302, 
304, 306. Collector plug-ins collect data for specific client system components. The 

15 plug-ins include application plug-ins 314, database plug-ins 310, server plug-ins 312, 
operating system plug-ins 308, and business process plug-ins 316. Each collector 318 
contains all of the available plug-ins for a business system environment. After a 
collector is installed on a host, appropriate plug-ins must be enabled for the host. For 
example, if a web server 304 is running on a host, the web server plug-in 312 and the 

20 operating system plug-in 308 must be enabled for that host 304. The web server plug- 
in 312 captures metrics that reflect the status of the web server, and the operating 
system plug-in captures metrics for the host system 304. If multiple instances of an 
application or database are running on a host, a collector plug-in may be configured to 
monitor each instance. Monitoring additional instances does not require additional plug- 

25 ins, since only one plug-in is required to collect data from multiple domain instances. 
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Client aggregators 320 receive extracted data from various collector plug-ins 
installed on host machines, and sends the data to a management server aggregator, as 
shown in FIG. 4. The client aggregator 320 opens a socket in a firewall 326 and pipes 
the data through in a single stream to a system manager located on a management 

5 server as shown in FIG. 4, where it is used to track the health and performance of the 
client business system as well as execution of the business process. To ensure security, 
Secure Socket Layer (SSL) technology and data encryption techniques are used to 
transmit the collected data. In addition, data is transmitted via a proprietary TCP/IP- 
based communication protocol. To maintain a high level of efficiency and reduce 

10 network traffic, collector plug-ins only transmits data to the system manager when the 
values of the datapoints change. If a datapoint's value has not changed since the last 
collection interval, the collector plug-in does not send any data. If a datapoint's value 
has changed since the last collection interval, the collector plug-in sends a new value. 
The system manager makes the necessary calculations to determine if a datapoint value 

15 has exceeded a threshold and therefore requires analysis. Collector plug-ins perform 
some simple calculations on incoming datapoints, such as summation, averaging, etc. 
More complex calculations are performed in the management system server. Table 1 
shows some typical types of data that is collected by a web server plug-in 312. 



Information Collected 


Example 


Web server configuration 


Server address, server port, number of http processes. 


Web server performance 


404 errors, response time, thread usage, memory 
usage. 


Web page requests 


Web page requests per hour, day, week, or year. 


URI errors 


Error count per URI. 
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Information Collected 


Example 


Persistent connections 


Number of persistent connection requests or number 
of persistent connection failures. 


Web server cache 


Current cache size or ratio of cache hits to the number 
of look-up requests. 


Web server threads 


Number of active or idle threads. 


Security 


Is security active? 



TABLE 1 



A database plug-in 310 monitors instances of a database host. One database 
plug-in is capable of monitoring multiple instances of databases. Table 2 shows some 
typical types of information that the database plug-in 310 collects. 



Information Collected 


Example 


Sessions 


Number of current and active sessions. 


Tablespaces and tables 


Tablespace status, free space, table size, extents. 


Database files 


Tablespace name, file size, number of blocks. 


Database server 


Rimning processes, CPU and memory usage. 


Listener and instance manager 


Listener process name and status. 


Disk-intensive SQL and cursor 
usage 


Average time for SQL command, number of open 
cursors. 


Rollback transactions and 
contentions 


Number of GETS and consistent changes. 


Library cache 


Hit ratio for library cache. 



TABLE 2 
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An operating system plug-in 308 collects data from the host operating system. 
Table 3 shows some typical types of information that the operating system plug-in 308 
collects. 



Information Collected 


Example 


Network 


Number of incoming and outgoing packets. 


System processor 


Percentage of processor utilization or idle time. 


Svstern nrocesses 


Number of active processes, percent of CPU 
utilization by processes, number of page faults per 
second. 


Physical memory 


Amoimt of available and used memory. 


Virtual memory 


Amount of available and used virtual memory, 
number of page read and writes per second. 


System cache 


Nimiber of cache hits. 


File system 


Amount of used and free space, file system status. 


Directory changes 


Number of new files in a directory. 


Configuration 


IP address, DNS address, default gateway. 


TCP/IP usage 


Number of current and reset connections, number of 
failed connection attempts. 



5 TABLE 3 

Application plug-ins 314 extract low-level and mid-level data fliat concern a vendor 
application. They collect information that is specific to a particular application. 
Business process plug-ins 316 extract high level data concerning a vendor application, 
as well as related types of data concerning the vendor business process. A business 



10 process manager in the management server uses this data to determine the effectiveness 
and efficiency of the business process. 
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The business systems transaction and infrastructure management system of the 
present invention provides a graphical user interface (GUI) 324 that runs on a web 
browser, allowing users to connect to the management server of the system through an 
intranet or the Internet. This interface provides different views of all elements that are 

5 involved in the execution of the business process, and displays system status 

information, business transaction execution information, and real-time diagnostics. 
Detailed information is available for every application, database, and operating system 
platform that is involved in the business process. The GUI 324 also provides a listing of 
all notifications that result from problems that the various engines in the management 

10 server report. 

The system client also includes a configuration client 322 that maintains the 
current configuration of the host system 302, 304, 306. The configuration data used by 
the configuration client may be either hard coded into the client system, or may be 
downloaded fi:om the management server. 

15 Turning now to FIG. 4, FIG. 4 shows an architecture of a management system 

server 40. The management server 40 logically groups software that combines, 
correlates, and analyzes data that the collections system described above extracts form 
the client business system. The management server consists of several server processes 
that perform the central functions of the product. These server processes provide back- 

20 end processing for model instantiation, system status evaluation, GUI display 

generation, report generation, configuration management, real-time diagnostics, and 
predictive analysis. The management server comprises a system manager 424, a 
business process manager 426 if required, and a configuration manager 428 connected 
to the repository 418, a system conmiunication bus 430, and a GUI server 406. A data 

25 repository 418 connects to the system manager 424, the business process manager 426 , 



18 



wo 02/079928 PCT/US02/09103 

the configuration manager 428, a data processor 420 and the system communication bus 
430. At least one aggregator 404 comiects to a firewall 402 and the system 
communication bus 430. The data processor 420 connects to the repository 418. A 
reasoning system 408, a notifications processor 414, and a corrective actions processor 

5 416 connect to the system communication bus 430. The GUI server 406 connects 
between the firewall 402 and the system manager 424, business process manager 426 
and the configuration manager 428. The firewall 402 connects to the Internet. 

The system manager 424 is the central system controller that understands the 
communication and data exchange protocols. During configuration, the system manager 

10 424 creates objects in the repository 418 based on the customization of a target business 
system. During runtime, the system manager 424 can create new objects, or instantiate 
those it has already defined. The system manager 424 is responsible for modeling data, 
generating patterns, recognizing datapoints that trigger analysis, and maintaining 
relationships between data extracted from system components. It receives data about 

15 collector installation, domain configuration, and datapoints, and processes data in a 

queue on a first-come, first-serve basis. The system manager 424 creates new domains, 
components, and datapoints in a database; modifies strings of existing domains, 
conq)onents, and datapoints; creates and maintains relationships between objects in the 
database; inserts new datapoint values into the database; determines whether the current 

20 datapoint value has met or exceeded a threshold; addresses errors that are generated in 
the data insertion phase; addresses communication messages by sending results to the 
client components, updating the GUI with changes, and communicating with the client 
using proprietary protocols; builds datasets by maintaining a snapshot of the client 
environment and maintaining relationships between components; and normalizes raw 

25 data for analysis and converts data into a map value. Initialization procedures for the 
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management server are initiated when the system manager 424 starts a single monitor 
process. All other server processes are then started. Once the system manager 424 is 
started, it tracks each process and restarts any processes that go down. 

The business systems transactions and infrastructure management system 

5 described herein is a complete, object-oriented system. Data objects that represent 
business process workflows, business applications, databases, and operating systems 
are stored in an Object-Oriented Database Management System (OODBMS), Using 
XML representations, the system manager 424 and other server components create and 
instantiate these objects in the repository 418 at initialization. Run-time data such as 

10 client configuration data and client system status information are also stored in the 
repository 418. The management system contains an efficient data model that allows 
multiple engines to use the same data. The repository 418 is the central store for 
configuration data, collected data, the knowledge base, the business process, 
component models, and relationship models. The repository 418 also contains a 

15 knowledge base that encapsulates the expertise of business analysts, system 

administrators, and application administrators. The reasoning system 408 uses the 
decision tables in the knowledge base to identify complex problems that involve 
multiple applications and components. 

The management server 40 contains one or more aggregators 404 that act as a 

20 hub and a router. Various sources send their data through the aggregator 404, enabling 
the data to pass through firewalls 402 to other destinations in the system. Aggregators 
404 perform various functions, including maintaining system security when passing 
data through firewalls 402, transmitting extracted data from a collection system 
described above to the system manager 424 and other management server components, 

25 relaying messages and instructions from the management server components to the 
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collection system, and monitoring connectivity of both the client and server components 
of the management system, including notifying the system manager 424 of connectivity 
lapses. The location and configuration of aggregators 404 depends on the number of 
firewalls 402. At a minimum, a system requires two aggregators 404, one on each side 
5 of a firewall 402. One is the client aggregator in the collection system (see 320 in FIG. 
3) and the other is the management server aggregator 404 shown in FIG. 4. 
Aggregators 404 preserve firewall security by piping all data through a single, full 
duplex socket, using Secure Sockets Layer (SSL) and data encryption. Aggregators 404 
can accept data from muhiple sources. The aggregator 404 briefly caches the data, and 
10 pipes it in a single stream through one dedicated socket in a firewall 402. On the other 
side of a firewall, a second aggregator receives the data and forwards it to the 
appropriate target destination, which may be a single or multiple destinations. The 
management server aggregator 404 passes messages and instructions from the server 
conq)onents to the collection system. For exanQ)le, when there is a need to change the 
15 intervals for collecting web server (312 in FIG. 3) data in a client collection system 
using a configuration client (322 in FIG. 3), the configuration manager 428 sends 
instructions to the management server aggregator 404, the aggregator 404 relays the 
interval instructions through the firewall 402, the client aggregator (320 in FIG. 3) 
accepts the instructions and notifies the appropriate collector (318 in FIG. 3) which 
20 notifies the web server plug-in (312 in FIG. 3), and the plug-in initiates the interval 
change. Both client and server aggregators monitor socket connectivity and host 
availability for all server components. Each aggregator monitors the connections 
through which it receives its data. The management server aggregator 404 monitors 
socket connectivity of all server components such as the system manager 424, 



21 



wo 02/079928 PCT/US02/09103 

configuration manager 428, and reasoning system 408, and socket connectivity with 
client aggregators. 

The data processor 420 performs calculations on raw data received from the 
collection system. It provides statistics on the extracted data and performs complex 
processing such as calculating datapoints that span collection cycles. For example, the 
data processor 420 computes the daily average or the standard deviation on the 
collected data. It also generates datapoints and stores them in the repository 420. 

The configuration manager 428 manages changes in business system 
environments as businesses change business processes and their execution 
infrastructures. When a collector in a collection subsystem detects changes in a business 
system configuration, or when the configuration manager 428 is used to make 
configuration changes to a business system, the configuration manager 428 updates 
object models in the repository 418. In this manner, the management system can 
reconfigure itself to adapt to a new environment. The configuration manager 428 is 
responsible for the configuration of the entire management system. The configuration 
manager 428 makes use of a GUI that is used to fine-tune the management system client 
and server components. A web browser GUI (324 in FIG. 3) for use with the 
configuration manager 428 is shown connected to the firewall in flie collection system, 
although it may reside anywhere that provides a connection to the Internet. This GUI 
enables a user to configure individual collector plug-ins, establish data collection 
intervals, create corrective actions, execute semi-automated corrective actions, and 
view analysis results and corrective actions executed. The web browser GUI 
communicates with the configuration manager 428, which communicates with 
configuration clients in the host system. 
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The business process manager 426 interacts with the system manager 424 to 
calculate and aggregate workload and performance statistics, track foiled executions of 
business processes, analyze business impact, and display current results. Using 
predefined knowledge of the business environment and the relationships between 
5 process definitions and the underlying infrastructure components, the business process 
manager can quickly detect failures in the business process and their causes. 

The reasoning system 408 analyzes data collected by the collection system to 
determine the health of a business system. The reasoning system 408 consists of an 
extensive knowledge base of problems and associated reconmiendations, and analysis 
10 engines that detect and predict problems. A diagnostic engine 410 is used to detect and 
analyze faults, and a predictive analysis engine 412 is used to identify potential 
problems before they result in faults. 

The management server 40 is able to inform users when conditions change that 
may lead to a problem through a notification manager 414. In addition to viewing status 
15 messages in a GUI and configuring the notification manager 414 to send email, numeric 
pager, and text pager notifications when changes or problems occur, a user may 
configure the corrective actions manager 416 to resolve a problem or condition 
automatically or on demand. 

The management server contains a GUI server 406 to provide an interface with 
20 the transactions and infi-astructure management system. The GUI server, which runs on 
a web browser, uses XML source for its pages. When a GUI page is requested, the 
GUI server 406 obtains the most current data from the repository and constructs the 
requested page. 

Timiing now to FIG. 5, FIG. 5 shows a flow diagram 50 for a data collection 
25 process. At the time of installation of the management system, a client collector system 
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is installed on a client system, as shown in FIG. 3. Data collectors provide services to 
other components of the management system by using appropriate plug-ins to extract 
detailed information about the relevant business environment. Data collectors and 
appropriate plug-ins are installed 502, and collection intervals are configured for each 

5 plug-in 504 on each host. When the host is started, the configured plug-ins are loaded 
into memory and each plug-in collects data at the rate that was specified at 
configuration time. Any data collection errors are logged 506 and corrective action is 
executed 508. The data collectors communicate with a system manager, configuration 
manager, aggregator and the network 510 in order to execute its fiinctions. The plug- 

10 ins are started and data transmission security is enforced 512, and mew datapoint values 
are sent to the system manager by the data collector only when there if a change in 
value 514. If no change in datapoint value has occurred, no data is sent, reducing the 
volume of network traffic. Configuration changes are implemented to the host system 
configuration using the configuration client 516. Collected data is transmitted via an 

15 aggregator to the system manager 518, which stores it in a repository for determination 
of whether analysis of the data is required. If required, the system manager submits the 
appropriate dataset to the reasoning system for analysis. The process continues 520 
until the execution is terminated 522. 



20 Data Objects 

The transactions and infi-astructure management system is an object-oriented 
system. The system manager and other server elements create and instantiate data 
objects representing business process workflows, business applications, databases, and 
operating system resources that are stored in an Object-Oriented Database Management 

25 System (OODBMS), The database may also be a relational database. Using XML 



24 



wo 02/079928 PCT/US02/09103 

representations, the system manager and other server components create and instantiate 
these objects in a repository at time of installation. Run-time data such as client 
configuration data and client system status information are also stored in the repository. 
The system uses a data model that is very efficient and allows multiple processes to use 

5 the same data. 

The system captures the definition of business processes through workflow 
models that encapsulate every step of their execution. It also captures the characteristics 
of applications, systems, and databases in object models representmg each element of 
the execution mfrastructure. Inter-dependencies among business processes, 

10 applications, databases, and operating platforms are captured in relationship object 
model that bind the business process and the entire execution infrastructure. These 
object models are mstantiated at runtime and stored in the repository as representations 
of elements of business process infrastructure referred to as data objects. Key data 
objects include domain, instance, component, datapoint, dataset, and relationships. 

15 A domain is the highest level class that is managed by the system. Domains are 

associated with a web server, operating system, database and business application. 
Domains group related components. 

A component is part of a domain whose data is being tracked. For example, a 
file system, system process, and log file are all components. Components are modeled 

20 as a group of data points belonging to the same area, performing the same function, or 
sharing conmion attributes. Each domain contains one or more components, and a 
component has a type and one or more instances. The type represents the overall 
category of the component object, while an instance consists of a specific case within 
that category. For example, a file system is a type of component on a server, while a 
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file system partition is an instance if a file system component. An instance is defined as 
a specific case or example of a component. 

In a particular domain, one component may be a file system. An instance of the 
file system component is a particular file system partition. Multiple instances of a 
5 component may exist. A second instance of the file system may be a second file system 
partition. 

Datapoints are the monitored characteristics of components and component 
instances. File system datapoints might include total disk space, amount of firee space, 
and amount of used space on the disk. Associated with each datapoint is a unique 

10 definable metric, or unit of measure whose value identifies a specific aspect or 

characteristic of the component. The collector plug-ins extract the values of datapoints 
firom the host machines. Datapoints typically store real values. For example, CPU 
utilization is expressed as a percentage. 

Datasets are logical collections of datapoints stored within the repository. The 

15 datapoints in each dataset can belong to any number of components, and any number of 
datasets can contain the same datapoint. The groupings are based on various types of 
relationships between datapoints. For exan5)le, since database function is affected when 
a file system runs low on space, a dataset might group datapoints that monitor file 
system space within the operating system domain, plus datapoints that monitor database 

20 activity in the database domain. Each dataset represents an attribute that the reasoning 
system uses for analysis. In order to compare the states of several datapoints without 
knowing the actual values of the datapoints, datasets use normalized values. Dataset 
definitions are created during the knowledge capture phase prior to product distribution. 
At run time, the system manager creates instances of dataset definitions. These dataset 

25 instances are used by the reasoning system for analysis. 
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Relationships connect two or more things as being, belonging, or working 
together. Components often have relationships with one another. For example, a word 
processing application may depend on a printer connected to a particular host in order 
to print, and therefore establishmg a relationship. The relationship between components 
5 is very important when creating individual datasets. When building dataset instances, 
the server looks for these relationships. For example, if the printer and spooler are 
related to each other and there is a dataset definition that contains datapoints for both, a 
dataset instance will combine only the datapoints from the related printer and spooler. 
This is crucial to the problem determination that is based on this relation. If the printer 

10 is inoperative, only the spooler that is associated with that printer is affected. The 
system manager instantiates datasets based on relationships. As the system manager 
collects, stores, and analyzes data from a system, the management server checks for 
relationships that exist between the various elements of a business enterprise. 
Relationships are the dependencies, interactions, and working associations among 

15 domains, components, component instances, and datapoints that are being monitored. 



Analysis, Diagnosis, and Recovery 
The transactions and infrastructure management system uses a collection system, 
depicted in FIG. 3, to collect and store data about a business system environment. The 
20 reasoning system (408 in FIG. 4) then analyzes the data to determine the health of the 
business system. The reasoning system consists of an extensive knowledge base of 
problems and associated recommendations, and analysis engines that detect and predict 
problems. Once the data is analyzed, if a problem or unusual condition is present, 
notifications may inform a user or group about tat problem or condition. A corrective 
25 action script may automatically resolve the problem. These features help to ensure that 
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a business system remains available and productive to meet the needs of the business. 
The reasoning system performs data analysis and provides two main functions. The 
diagnostic engine (410 in FIG, 4) uses collected data to ascertain the probable cause of 
anomalies in a business system. Once a cause is identified, the diagnostic engine 

5 provides advice for resolving the problem or condition. Notifications may be sent to a 
user, or corrective action may be automatically activated. A predictive analysis engine 
(412 in FIG. 4) uses collected data to determine trends in a business system, allowing it 
to forecast performance loads and system failures. 

Tuming now to FIG. 6, FIG. 6 shows a flow diagram 60 for a data analysis 

10 process. The collector plug-ins gather data 602 from each host and domain in a host 
business system, and the system manager receives, organizes, and stores the collected 
data in a repository 604 as datapoints and datasets. As it receives the data, the system 
manager monitors the values of the collected datapoints to determine if a threshold has 
been exceeded 606. If a datapoint exceeds a threshold 608, the system manager submits 

15 the dataset to which the datapoint belongs to a queue for a reasoning system 6 10. If a 
datapoint does not exceed a threshold 608, the datapoint is stored in the repository 620 
and the data gathering step 602 is repeated. This analysis occurs quickly. A slight delay 
incurred by submitting the analysis request to a queue allows the data to stabilize, making 
it difficult to trace a source of the problem. The reasoning system detennines what type of 

20 analysis is needed and submits the dataset to either a diagnostic engine or a predictive 

analysis engme 612. If submitted to the diagnostic analysis engine, the diagnostic analysis 
engine analyzes the problem or condition and determines probable cause 616. If 
submitted to the predictive analysis engine, the predictive analysis engine views the data 
over a period of time to detect trends 614. If one of the analysis engines discovers a 

25 problem or trend, it generates an analysis result and a notification, for example, either a 
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probable cause of a problem notification or an estimated time to failure or impact 
notification, which is provided to a user 618. Notifications may be by email, numeric 
page, or text page to an appropriate individual or group of individuals. If corrective action 
has been selected for the particular problem or condition, a corrective script will run 
5 automatically or by user initiation on the host that is experiencing the problem 618. The 
reasoning system diagnoses problems and conditions, and makes predictions based on a 
threshold value for datapoints. For example, if a datapoint measure CPU utilization, the 
threshold determines whether CPU utilization is low (good), medium (fair), high 
(warning), or very high (critical). This distinction is important in data analysis because it 

10 determines how the condition is diagnosed. When a reasoning system analyzes a dataset, 
it may not generate an analysis result. It only generates an analysis result when a data 
pattern matches information in a knowledge base with a high degree of certainty. 

The diagnostic engine can identify patterns in collected data, which allow it to 
determine the state of the system or a problem that may be present. To accomplish this, 

15 the system manager organizes the data into datasets, each of which contain specific 

datapoints. The system manager stores datasets in a repository. A datapoint describes a 
specific attribute of a component, such as CPU utilization and available disk space. A 
dataset is a group of datapoints with a certain value range. For example, a dataset may 
contain a datapoint for CPU utilization that has a "warning value, indicating that the CPU 

20 is almost fiiUy utilized. When the reasoning system submits a dataset to the diagnostic 
engine for analysis, the diagnostic engine retrieves the dataset from the repository, 
examines the patterns in the data, and matches these patterns with information in its 
knowledge base that best describes the current state, potential problems, or existing 
problems. If the diagnostic engine finds an anomaly, it determines the probable cause or 

25 condition and generates an analysis result that may be viewed using a GUI. Each analysis 
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result describes the problem or condition, its severity, the date and time of occurrence, 
and short and long-term solutions, if applicable. Problems or conditions can trigger 
notifications to individuals or groups of individuals, and corrective action script to 
remedy the problem or condition. 
5 The predictive analysis engine analyzes collected data over time to discover trends 

in a host business system. Predictive analysis discovers trends in degrading performance 
and potential system failures. When the predictive analysis engine diagnosis a problem or 
condition, it analyzes datapoint values over time to determine trends in system resources 
and business processes. If the predictive analysis engine detects a trend, it produces and 

10 sends an analysis result to a GUI. The analysis result may be a graph that displays the 
values of a datapoint or datapoints over time. Predictive analysis estimates the time 
available until resources are depleted or until a failure occurs, warning the user in 
advance to enable correction of the problem or condition. 

W^en the reasoning system diagnoses an anomaly and reports the analysis result, 

15 the notification manager automatically sends an email, a numeric page, or a text page to a 
user or group of users. A knowledge base defines the affected group. Group members, 
email addresses and paging destinations must be configured. Notifications may be 
configured and customized. Only configured groups are notified. After groups and 
destinations are configured, the reasoning system analyzes the data and determines the 

20 condition or problem that caused it, generates an analysis result, and the notification 
manager notifies the impacted group. 

Turning now to FIG. 7, FIG. 7 shows a flow diagram 70 for a corrective action 
process. This process is invoked when abnormal conditions or problems occur, and can be 
configured to resolve the problem or condition automatically or on demand. Corrective 

25 actions are associated with analysis result cases. Corrective scripts must be installed in 
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each host prior to their use 702. The reasoning system analyzes data and determines the 
condition or problem that caused it 704. When the reasoning system matches patterns in 
the collected data with information in the knowledge base, it generates an analysis result 
that describes the problem or condition 706. The system manager then notifies an 
5 appropriate collector to run an appropriate corrective script 708. If the corrective script 
has been configured to be an automated corrective action, the script runs on the host with 
the identified problem or condition 710. If the corrective script has been configured to be 
a semi-automated corrective action, the management server requests a user's manual 
approval before executing the corrective script on the host with the identified problem or 
10 condition 712. 

Although the present invention has been described in detail with reference to 
certain preferred embodiments, it should be apparent that modifications and adaptations 
to those embodiments may occur to persons skilled in the art without departing firom the 
spirit and scope of the present invention as set forth in the following claims. 

15 
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What is claimed is; 
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1 . A system for managing business systems transactions and infrastructure, 
comprising: 

5 a collection system embedded in a client system, including one or more data 

collectors having at least one plug-in for extracting data from client system 
components, and a secure client connection to a communication network 
for transmitting the extracted data from the client aggregator and 
communicating over the network; and 

10 a management server including a system manager for controlling the 

management server, a secure server connection to the communication 
network for communicating over the network and receiving the extracted 
data from the client collection system, the system manager for comparing 
the received extracted data with threshold values, a repository for storing a 

15 knowledge base and the extracted data, a reasoning system for performing 

data analysis on the extracted data that exceed threshold values, and a 
notifications manager for notifying users of abnormal conditions based on 
the data analysis. 



20 2. The system of claim 1 , fiirther comprising; 

a client aggregator for receiving the extracted data from the data collectors; 
and 

a server aggregator for receiving the extracted data from the secure 
connection. 
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3. The system of claim 1, further comprising a corrective actions manager for 
activating corrective action scripts in client systems over the communication 
network. 



5 4. The system of claim 1 , further comprising a graphical user interface server 

connected to the secure server connection to the communication network and a 
web browser graphical user interface connected to the secure client connection to 
the communication network for communication with the graphical user interface 
server. 

10 

5. The system of claim 1, wherein; 

the collection system further comprises one or more configuration clients for 
receiving configuration commands for configuring the client system; and 
the management server further comprises a configuration manager for sending 
15 configuration commands to the collection system. 

6. The system of claim 1, wherein the data collector plug-in is selected from the 
group consisting of operating system plug-in, database plug-in, business process 
plug-in, web server plug-in, and application plug-in. 



20 



7. The system of claim 1, wherein the secure server connection and the secure client 
connection to the communication network are firewalls. 
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8, The system of claim 1 , wherein the management server further comprises a 
business process manager for analyzing and tracking client business system 
processes based on the extracted data. 

5 9. The system of claim 1 , wherein the management server further comprises a data 
processor for performing complex calculations. 

10. The system of claim 1, wherein the management server reasoning system 
comprises a diagnostic engine and a predictive analysis engine for analyzing the 

10 extracted data datasets and activating the notifications manager. 

1 1 . The system of claim 3, wherein the management server reasoning system 
comprises a diagnostic engine and a predictive analysis engine for analyzing the 
extracted data datasets and activating the corrective actions manager. 

15 

1 2. The system of claim 1, wherein the repository is an object oriented database 
management system. 

13. The system of claim 1, wherein the repository is a relational database management 
20 system. 

14. The system of claim 1, wherein the system components from which data are 
extracted is selected from the group consisting of a database host operating 
system, a database host database management system, a database host business 

25 process, a database host hardware components, a web host operating system, a 
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web host web server, a web host business process, a web host hardware 
components, an appHcation host operating system, an appHcation host application 
programs, an application host business process, and an application host hardware 
components. 

5 

15. The system of clmm 3, wherein the embedded collection system further includes 
corrective scripts associated with the data collectors that are activated by a server 
command from the corrective actions manager. 



10 1 6. The system of claim 1 , wherein the communication network is the Internet. 



17, A method for managing business systems transactions and infrastructure, 
comprising the steps of: 

extracting data from client system components by a data collector and data 
15 collector plug-ins; 

transmitting the extracted data through a secure client connection to a 

communication network; 
receiving the extracted data through a secure server connection to the 
communication network; 
20 comparing the extracted data by a system manager with threshold values 

stored in a repository; 
submitting the extracted data to a reasoning system if the extracted data 

exceeds a threshold value; 
analyzing the extracted data submitted to the reasoning system; and 
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notifying a user affected by the results of the reasoning system analysis by a 
notification manager. 



18. The method of claim 17, further comprising: 

5 aggregating the extracted data by a client aggregator; 

transmitting the extracted data by the client aggregator; and 
receiving the extracted data by a server aggregator. 

19. The method of claim 17, further comprising storing the received extracted data in 
10 the repository. 

20. The method of claim 17, further comprising automatically activating a corrective 
script in the client system by a corrective actions manager to correct a problem 
found by analyzing the extracted data, 

15 

2 1 . The method of claim 1 7, further comprising manually activating a corrective 
script in the client system by a corrective actions manager to correct a problem 
found by analyzing the extracted data. 

20 22. The method of claim 17, v^^herein the analyzing step comprises: 

detecting a problem from the extracted data by a diagnostic analysis engine; 
and 

associating the detected problem with a recommended solution found in a 
knowledge base stored in the repository, 

^ 25 
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23. The method of claim 17, wherein the analyzing step comprises: 

collecting the extracted data over time; 

detecting a trend in the collected extracted data by a predictive analysis 
engine; and 

5 estimating a time duration for a failure condition to occur. 

24. The method of claim 17, wherein the notifying step comprises: 

accessing a knowledge base in the repository to determine a user affected by 
the results of the reasoning system analysis; 
10 sending the results of the reasoning system analysis to a graphical user 

interface server; and 
transmitting the analysis results over the communication network to a client 
web browser graphical user interface for presentation to the affected user. 

15 25. The method of claim 17, wherein the notifying step is selected from the group 

consisting of transmitting an em^il message, sending a numeric page, and 
transmitting a text page. 



26. The method of claim 17, further comprising: 
20 transmitting a client system configuration change request from a client web 

browser graphical user interface through the secure client connection to the 
conununication network; 
receiving the configuration change request by a configuration server through 
the secure server connection to the communication network; 
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communicating the configuration change request to a configuration manager 

fi-om the graphical user interface server; 
validating the configuration change request and communicating a 

configuration change order to the graphical user interface server by the 
5 configuration manager; 

transmitting the configuration change order fi"om the graphical user interface 
server through the secure server connection to the conmiunication 
network; and 

receiving the configuration change order by a configuration client through the 
10 secure client connection to the communication network. 



27. A computer-readable medium containing instructions for controlling a computer 
system to carry out the steps of claim 17. 

15 28. A method for managing business systems transactions and infi^astructure, 
comprising the steps of: 

receiving extracted transactions and infirastructure data fi'om a business 
system; 

comparing the extracted data with threshold values stored in memory; 
20 analyzing the extracted data that exceeds threshold values; and 

notifying a user of a result of the analyzing step. 



29. The method of claim 28, further comprising activating a corrective script based on 
a result of the analyzing step. 

25 
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30. The method of claim 28, further comprising sending configuration commands to 
the business system. 



3 1 . The method of claim 28, fiirther comprising sending notifying information to a 
5 graphical user interface. 

32. A computer-readable medium containing instructions for controlling a computer 
system to cany out the steps of claim 28. 

10 33. A method for managing business systems transactions and infrastructure, 
comprising the steps of: 

extracting transactions and infrastructure data from a business system; and 
sending the extracted data to a system for comparing with thresholds, 

analyzing extracted data that exceeds thresholds, and notifying a user of a 
1 5 result of the analyzing step. 

34. The method of claim 33, further comprising activating corrective scripts in the 
business system. 

20 35, The method of claim 33, further comprising reconfiguring the business system 
upon receipt of a reconfiguration command. 

36. The method of claim 33, fiirther comprising receiving notification information and 
presenting the information on a graphical user interface. 

25 
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37. A computer-readable medium containing instructions for controlling a computer 
system to carry out the steps of claim 33. 
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